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We consider the problem of comparing two diagnostic tests 
based on a sample of paired test results without true state de- 
C^> terminations, in cases where the second test can reasonably be 

assumed to be at least as specific as the first. For such cases, 
j~j we provide two informative confidence bounds: A lower one for 

the prevalence times the sensitivity gain of the second test with 
respect to the first, and an upper one for the sensitivity of the 
first test. Neither conditional independence of the two tests nor 
perfectness of any of them needs to be assumed, 
i— i An application of the proposed confidence bounds to a sam- 

Ph pie of 256 pairs of laboratory test results for toxigenic Clostrid- 

ium difficile provides evidence for a dramatic sensitivity gain 
through first appropriately culturing Clostridium difficile from 
stool samples before applying an enzyme-immuno-assay. 

CZ2 
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> 

^ 1. Main results and applications. 

\o 

1.1. Introduction and outline. Inference for sensitivities or specificities of 
diagnostic tests can be next to impossible if no suitable method for deter- 
mining true states is available. Motivated by a real data problem described 
i—i below, and in more detail in [61 UJ, we consider here the situation where 

paired observations for two tests are given and where it can be assumed that 
the first test is less specific than the second. Can we then infer from suitable 
observations that the second test is more sensitive, and hence better, than 
the first? And if yes, by how much? 



Theorem 1.1 in Subsection 1.6 below provides a simple and in some sense 



optimal answer. The necessary notation and concepts are carefully explained 
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before in Subsections 1.2 1.4, but some readers may wish to start less formally 
by first consulting Subsection |1.5[ which introduces our motivating example, 



and then proceed to the application of Theorem Ll_ given immediately after 
its statement. There it turns out that the answer to the above "how much?" 
question depends on upper bounds assumed for the prevalence, but that nev- 
ertheless interesting upper bounds for the sensitivity of the first test can be 
given without such an assumption, using our Theorems 1.2 and 1.3 Neither 
of our results uses any further assumptions, such as the conditional indepen- 
dence assumption as discussed and criticized, for example, in Section 7.3]. 
We prove Theorems [TTT L3 in the final Section [4j after collecting auxiliary 



results on latent class models in Section [2] and proving them in Section |3j 

While there is a substantial literature on various aspects of the statistics of 
diagnostic tests, see in particular the monographs [H El E] , we are not aware 
of a previous treatment of the problem considered here. Our assumption that 
the first test is less specific than the second may seem very special, so let us 
point out that, for the purpose of obtaining upper bounds on the sensitivity 
of the first test, our assumption may by Theorem L3 replace the always less 
plausible assumption of perfectness of the second test, see Subsection 1 1 . 8| for 
an example. 



1.2. Mathematical and probabilistic notation and conventions. We use "iff" 
as an abbreviation for "if and only if". We write N := {1, 2,3, . . .}, No := 
{0} U N, and R := R U {-oo, oo}. We put x/0 := oo for x > 0, but we define 
0/0 below at each occurence separately to be either or 1/2 or 1. A sub- 
script "+" indicates summation with respect to the variable it replaces, as in 

x + = Yh=i x i f° r x e ^™ or m © below for k G Nq 0,1 ^ . By contrast, a super- 
script "+" indicates the positive part, so x + = x V = max{i, 0} and corre- 
spondingly x' = (—x) V0 for i 6 I. As usual, the order theoretic operations 
A and V are computed first in expressions like abAc := a {b Ac) = a min{6, c}. 
If X and y are any sets, then 

prob(A') := {(X3x^p x G [0,1]) : ^ P:t = l} 

mark(^, y) := {{X x y 3 (x, y) ^ p y]x ) : p. ]x e prob(^) for x G X} 

denote the set of all discrete probability densities on X and the set of all 
discrete Markov transition densities from X to y, where the standard dot 
notation p,\ x for the partial function y i-> p y \ x has been used. With M np 
we denote the multinomial distribution with sample size parameter n and 
success probability vector p G prob(A') for some X, that is, M„ iP ({&;}) = 
n\ HxeAp k x x /kJ) for k G with £ x . e * k x = n. 
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1.3. Confidence bounds and their comparison. Let V = (P$ : •& G 0) be 
a statistical model on a sample space X and let k : — » R be a parameter 
of interest. We allow nonidentifiability of k, that is, we may have $i,$2 £ 
with Ptfj = P# 2 but 7^ ^($2)- For lack of any better name, let us call 
the pair (V, k) an estimation problem. Let G [0, 1]. Then every measurable 
function k : X — > R with P$(k < > /? for every t? G is called a lower 
^-confidence bound for (V,k). 

Now let ft and g be both lower /3-confidence bounds for (V, k). Then ev- 
erybody seems to agree that for preferring k over k, it would be desirable to 
have 

(1) P${K>t) < P#(K>t) for d G and t < «(#) 

For example, Lehmann and Romano [3[ page 72] would call k uniformly 
most accurate if ([I]) held for every k as above, but such a k is known to exist 

in exceptional cases only. The desideratum ([T]) could be supplemented by 
conditions for t > in different ways, see [9J page 162] for one possibility, 
but we stick to ([T]) as it is. Thus we call k worse than k, and equivalently 

k better than k, if (IT]) holds, and strictly so, if in addition strict inequality 

holds in Q for at least one ^ and one t. Accordingly, k is called admissible as 
a /3-confidence bound for (V, k), if no other such bound k is strictly better. 

Finally, k and k are called equivalent, if each is worse than the other, that is, 

if Q holds with "=" in place of "<". 

1.4. Latent class models for diagnostic tests. Informally speaking, a (di- 
chotomous) diagnostic test is a procedure yielding a guess G {0, 1} for the 
state G {0, 1} of any item belonging to some specified population. In this con- 
text, is called negative and 1 is called positive. In medicine, the population 
often consists of persons, for whom a positive state means actually having 
a certain disease, and a positive diagnosis means to be guessed to have the 
disease. The accuracy of a diagnostic test is modelled by two numbers called 
specificity and sensitivity, with specificity interpreted as the probability that 
a random negative item is diagnosed as negative, and sensitivity as the prob- 
ability that a random positive item is diagnosed as positive. The probability 
of diagnosing a random item from the whole population as positive, say, then 
of course depends also on the prevalence, which is the probability of such an 
item to be actually positive. If we formalize the above, for samples of size n 
rather than 1, and also admitting more generally d tests, rather than just 
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one, to be applied to every item, we arrive at the following model considered 
in essence already in [2]. 
Let d G N and 

Q d : = prob({0,l}) x mark({0,l},{0,l} d ) 

For $ = (vr,x) G 0^, let G prob({0, l} d ) denote the second marginal of 
the density 

(2) {0,l}x{0,l} d 3(i,j)^7t iXjli 
so that 

(3) = forjG{0,l} d andtf G0 d 
3 i=0 

Finally, with a given n G N often notationally surpressed in what follows, let 

(4) P# := M ft , MW for^G©, 

Definition 1.1. Let d,n e N. T/ie (full) latent class model /or a sample 
of size n of combined results of d diagnostic tests with unknown characteris- 
tics and for a state with unknown prevalence is Vd '■= {P& '■ $ G 0^). 

The interpretation of the parameter i9 = (jr, x) in this model is as follows: 
7Ti is the prevalence of positive states and x is the joint characteristics of the 
d diagnostic tests. 

For example, let d = 2. Then Xoilo * s t ne probability that a random negative 
(see the last bit of the subscript) is diagnosed negative by the first test (see 
the first bit of the subscript) and positive by the second (see the second bit of 
the subscript). And Xo+|o = Aoo|o + Xoi\o * s then accordingly the probability 
that a random negative is diagnosed negative by the first test, that is, the 
specificity of the first test. More systematically, and introducing a notation 
used below, we put 

(5) x^i ■■= x l+ \i and x ( $ ■■= x+ t \i for h t e {o, 1} 

and regard x > X £ mark({0, 1}, {0, 1}) as the characteristics of the first 
and of the second test, respectively. 

Coming back to general d, formula ^ gives the joint density of a true state 
determination together with the results of the d tests, for an item picked at 
random from the whole population, and is the marginal density corre- 
sponding to unobservability of the true state. Finally, the multinomial dis- 
tribution Ptf = M nj/t (0) models testing thus a random sample of size n from 
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the (conceptually infinite) population, and counting just the number of oc- 
curences of each possible combination of the d test results. 



In this paper, motivated by the application sketched in Subsection |1.5 
below, we are mainly interested in the case of d = 2, and here in particular in 
the submodel assuming that the specificity of the first test is at most equal 
to the specificity of the second. In terms of the parameter $ = {it, x) £ O2 
and with the notation introduced in ([5j above, this assumption is expressed 

(1) / (2) 

as Xo\o S Xo|o- 

Definition 1.2. Let n 6 N. In this paper, the restricted latent class 
model for a sample of size n of combined results of two diagnostic tests with 
unknown characteristics and for a state with unknown prevalence is V2,< '■ = 

(p^.tfe e 2> <) with e 2 ,< := {(7T, X ) e e 2 : x jg < 

1.5. Example: A comparison of two tests for diagnosing toxigenic Clostrid- 
ium difficile. Clostridium difficile is a certain species of bacteria. Some of 
these, called toxigenic, have the potential to produce one or both of certain 
toxins, called A and B. Toxigenic Clostridium difficile is responsible for one 
of the most prevalent infections of the human gut. It may lead to severe 
courses of infection and is easily transmitted in hospitals. A fast and accu- 
rate diagnosis would be highly desirable for initiating adequate therapy and 
preventing transmissions to other patients. Unfortunately, so far no diagnos- 
tic test, not even a complex and time-consuming one, has been proven to be 
highly accurate, that is, with specificity and sensitivity close to 1. 

Available diagnostic tests are applied to stool specimens of patients with 
diarrhoea, using one of the following three methods, with details to be spec- 
ified. The first, simple and a matter of a few hours, consists in performing 
an enzyme-immuno-assay {EIA) for the direct detection of toxin A or B in 
the stool specimen. The second, taking about 3 days, consists in trying to 
culture Clostridium difficile (possibly nontoxigenic) from the stool specimen 
on an appropriate medium and applying then a "confirmatory test" for toxin 
A or B, for example an EIA as above, to any cultured colonies. The third, 
again taking about 3 days, tests the cytotoxicital potential of the stool spec- 
imen by applying it to a vero-cell culture {cytotoxicity neutralisation test). 
For several such tests, different accuracy values were published during the 
last years, often obtained by assuming the cytotoxicity neutralisation test 
to be a sufficiently accurate reference test or "gold standard", see j6j [7] for 
appropriate references. 

One goal of jfH E] was to compare a test according to the first method 
described above {Test 1 or direct test) with a test according to the second 
method, with the confirmatory test being the same EIA as in the direct test 
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(Test 2 or culture test). Both tests were applied to each stool specimen of a 
sample of size 256, consisting of all liquid specimens sent to a microbiological 
laboratory during two consecutive months. The observed data were 



(6) 



^00 


= 210 


^01 = 


20 


^0+ 


= 230 


^10 


= 4 


k n = 


22 


k l+ 


= 26 


^+0 


= 214 


k + i = 


42 


k ++ 


= 256 



where, for example, k 01 is the number of specimens tested negative with Test 1 
and positive with Test 2. True states were unobservable. The prevalence of 
toxigenic Clostridium difficile, in the population of all liquid stool samples 
sent to a laboratory for microbiological investigation, is certainly not known 
precisely, but is believed to be very roughly 15%. So far it seems natural to 
use the full latent class model V2 for analyzing the data. However, as the 
EIA is applied in Test 1 to the whole stool specimen and in Test 2 only 
to a part of a culture from the specimen already identified as Clostridium 
difficile, it seems very plausible to assume that Test 2 is at least as specific 
as Test 1. This suggests that the restricted latent class model V2,< could be 
used, and that then the superiority of Test 2 would follow if the latter can 



be proved to be also more sensitive than Test 1. Theorem 1 1 . 1 1 in the next 
section is formulated with a view towards situations like the present, taking 
into account both models, V2 and T > 2 ) <- 

1.6. Main results. Application to the comparison of tests for diagnosing 
toxigenic Clostridium difficile. 

Theorem 1.1. Let (3 e [0, 1], n e N, 

(7) A : [k e nS 0,1}2 : k ++ = n)^ [-1, 1] 

be a function, and Ai := (M„ i5 : q G prob({0, l} 2 )) be a quadrinomial model. 
A. The following three assertions are equivalent: 

(i) A is a lower (3 - confidence bound in the model Ai and for the parameter 

(8) Q 1 -> %i ~ Qw 

(ii) A is a lower (3-confidence bound in the full latent class model T>2 and 
for the parameter 

(9) (tt.x) h> 7r 1 (xS-xSi)-(l-7r 1 )(xg-xS) 

(Hi) A is a lower /3-confidence bound in the restricted latent class model 
V2,< and for the parameter 

(10) fa*) H- m(x§-X§ 
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B. Let A obey the above conditions (i)-(iii) and let A be another such 
function. Then A is worse than A as a lower (3-confidence bound for (Ai, pi) 



iff it is so for (P 2 , @), and if it is so for (V 2 ,<, ([TO])) - (Once "iff", once "if".) 

C. If A is admissible as a /3-confidence bound for one of the problems 
(M, @) and (V 2 , @), then so it is for the other and for (P 2> <, ^)- 



See Section [4] for a proof of this and the other two theorems of this subsec- 



tion. We proceed to illustrate Theorem 1.1 by its application to the example 



from Subsection 1.5. Let (3 G [0, 1] and n 6 N be fixed. Wanted is a "good" 
confidence bound A as in Q and (iii) above. Parts B and C Theorem |1.1| 
suggest choosing A to be a "good" confidence bound as in (i). We put 



A(fc) := £(koi, fcio, k 00 + kn) for k G Nq 0,1 ^ with k ++ = n 

where I : {k G Nq : k + = n} — > [—1, 1] is the Lloyd-Moldovan lower (3- 
confidence bound for the coordinate difference prob({l, 2, 3}) 3 p i -> p \ — p 2 
in the trinomial model (M n p : p G prob({l, 2, 3})Y see Subsection 1.7 
A satisfies ([7]) and (i). With (3 = 0.95 and the data k from (J6|, we ge ; 



Then 



AO) = £(20,4,232) = 0.0320 

as our lower confidence bound in (i), corresponding to the point estimate 
Mi ~ 2§6 = IE = 0-0625. (Here and below, numbers in typescript like 0.0320 
are rounded consistently with the inequalities claimed.) Thus, assuming the 
restricted latent class model V 2t < and using (iii), we get the confidence state- 
ment 

(ID X^-TCV > 

Ki\i Kx\i ^ ^ 

with 7Ti > 0, so that Test 2 is significantly more sensitive than Test 1 
and hence, being at least as specific by assumption, significantly better. 
Without any upper bound on the prevalence tti, the best lower bound for 
the sensitivity gain Xm ~ Xm °f t ne culture test with respect to the di- 



rect test we can obtain from (11) is 0.0320. But assuming some plausible 
upper bound implies a dramatic sensitivity gain; for example, 7ri < 0.15 
yields Xm. ~~ Xm — 0.0320/0.15 = 0.21. This would imply in particular 

Xi\{ < 1 — (Xm ~ Xm) — 1 — 0.21 = 0.79 and hence a very poor sensitivity 
of the direct test. It is remarkable that the latter conclusion, with a slightly 
larger bound, can be obtained without any assumption on the prevalence by 



using the following theorems, see (17) and (19) below. 
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Theorem 1.2. Let /3 G [0, 1], n G N, 
(12) 5:{fcG Nj°' 1}2 : £;++ = n} -)• [0, 1] 



6e a function, and M. := (M n)? : g G prob({0, l} 2 )) &e a quadrinomial model. 

A. 5 is an upper (3-confidence bound in the model Ai and for the parameter 

(13) q H- -^^vf ^ 911 Al) uift °:=1 

iff it is so in the restricted latent class model 7\< one? for the parameter 

(14) (ti, X) » X { Z 

B. Let S obey the equivalent conditions from part A, and let S be an- 
other such function. If S is worse than S as an upper (3-confidence bound for 



(P 2j <, (14)), then so it is for (A4,(13)) 



C. If S is admissible as a (3-confidence bound for (Ai, (13)), then so it is 



for (P 2 ,<,(U)). 



We get a confidence bound for (A4,(13)), as needed for applying Theo- 
rem |1.2| A, from confidence bounds in certain trinomial models, similarly to 



but slightly less obviously than for the situation of Theorem 1.1 



Theorem 1.3. Let /3 G [0, 1], n G N, and u : {k e Nq : k + < n} -> [0, oo] 
be a function such that, for every m G {0,...,n}, the restriction of u to 
{k G Nq : k + = m} is an upper /3-confidence bound in the trinomial model 
(M m p : p G prob({l, 2, 3})) and for the parameter 



(15) p \-> (l-pa)V 

Then the function 



1 - Pi - P2 
(l-2pi) + 



A 1 



with 



(16) 



[k G Nj°' 1}2 : k ++ = n] 3 k ^ u{k 10 , k 01 , k n ) 



is an upper /3-confidence bound for (V2,<, (14)). 



As we are not aware of a function u as assumed in Theorem 11.31 and 
also well-founded and easily available for practical computation, we use here 
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the following ad hoc method: Let Uq denote the Lloyd-Moldovan upper (5- 
confidence bound corresponding to the lower bound £ used above. Then, since 



R.H.S.© = (1-riV '-f ^'/ m with °, 
< (l+ Pl -p 2 )Al 
for p G prob({l, 2, 3}), we may take u :— (1 + Wo)Al in Theorem 1.3 Applied 



to our data (|6]), this yields u(k w ,k i, k u ) = (1 + u Q (4, 20, 22)) A 1 = 0.83 and 
thus 

(17) xs;; < o.83 

with confidence 0.95, in the restricted latent class model without further 
assumptions. 



Going back to (11 ), obtained under the restricted latent class model, Part A 



of Theorem 1.1 suggests that we should perhaps rather state 



C\R\ J® > °- 032Q i 1 ~ 7Yl (J?) J 1 ) 
Xi|i — Xi|i > — — 1 — V x o|o — X |o 

as a valid confidence statement under the full latent class model. This not 
only makes obvious the effect of the possibility Xo|o — X }o < i n the larger 
model, drastically decreasing the lower bound for the sensitivity difference, 
but also the possibly drastic increase if we actually have x^o — Xo\l > and 
7Tx rather small. 

So far, we have for simplicity only considered part of the data from [6j [7]. 
There, we actually applied the direct test and three versions of the culture 
test, differing in the culture media used, to each of the 256 specimens. The 
media are called I, II, III in [6], [7j, and here ^ presents just the results for 
the direct test and for the culture test with medium II. Bounds analogous to 
the above lower confidence bound for the sensitivity gain through culturing 
with medium II, with the exemplary assumption 7Ti < 0.15, were computed 
for media I and III, resulting in —0.04 for I (so no statistically significant 
gain here) and 0.02 for III. For obtaining the upper confidence bound on the 
sensitivity of the direct test, without any assumption on the prevalence, we 
compared in [7J the direct test with the logical oring of the three culture 
tests, which diagnoses a specimen as positive if at least one of the three does 
so, yielding the data k 00 = 209, k i = 21, k w = 4, ku = 22 rather than (|6]), 
and hence the confidence statement 

(19) XiJJ < (1 + ^(4,21,22)) A 1 = 0.81 
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1.7. The Lloyd- Moldovan confidence bound for a coordinate difference of 
a multinomial parameter. The best currently available confidence bound I 
as needed in Theorem |1.1| appears to be the one proposed and implemented 
by Lloyd and Moldovan [5]: To compute it, load their program into R with 
load("sm_file_SIM2708_2"), type bcl(cl . side=-l) , where asks for 
the lower rather than the default upper bound obtainable with just bcl(), 
enter the three numbers x = ki, t = ki + k 2 und n = ki + k 2 + k 3 , with 
return after each, and then a few more returns, assuming here /3 = 0.95 for 
simplicity. 

1.8. Example: Robust upper confidence bounds for the sensitivities of diag- 
nostic tests for coronary artery disease. This subsection uses part of a stan- 
dard dataset, given in jH Table 5] and [U pp. 8, 17, 22] and drawn from [10J, 
to exemplify the final sentence of Subsection We consider evaluating two 
diagnostic tests for coronary artery disease (CAD). This disease is the most 
frequent cause of myocardic infarction, which in turn is the most frequent 
cause of death in developed countries. 

The first test considered is a dichotomized exercise stress test (EST), the 
second a dichotomized chest pain history (CPH). These two tests and a 
dichotomized arteriography (A) were performed on each of 1465 men. The 

dataset is a three-way table of counts k G Nq with k +++ = 1465 and with 
the indexing here corresponding to the ordering EST, CPH, A: k 000 = 151 
men negative for all three tests, fc oi = 25 positive only for A, k ow = 176 
positive only for CPH, fe n = 183, fcioo = 46 positive only for EST, kioi = 
29, fcno = 69, km = 786. As usual, it is assumed that the 1465 trivariate 
observables are independent and identically distributed. Let k EST := (h + j : 
(i, j) G {0, l} 2 ) denote the marginal table for just the results of EST and A, 
and let analogously k cpn := (k +i j : (i, j) G {0, l} 2 ) be the marginal table for 
CPH and A. Thus 

fc E ST = 327 fc*f T = 208 C H = 197 fc T H = 54 

fcf ST = 115 fe^ T = 815 fcf PH = 245 fc<f H = 969 

If, as in [U IB], the test A is assumed to be perfect, then we get the following 
four separate 95% binomial confidence statements (ignoring corrections for 
quadruplicity) for the sensitivities Se and Se and the specificities Sp 
and Sp CPH of the tests EST and CPH, 

(20) 0.770 < Se EST < 0.821 0.931 < Se CPH < 0.961 

0.696 < Sp EST < 0.781 0.398 < Sp CPH < 0.494 

using, for example, the R-command binom. test (c (815, 208)) for the first 
interval, and we may conclude that neither EST nor CPH is sufficiently ac- 
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curate. The perfectness of A means that its specificity Sp A and its sensitivity 
Se A are both equal to 1, or rather practically very nearly so. Here the as- 
sumption Sp A = 1 appears quite reasonable from the medical point of view, 
but Se =1 does not. Using now only the weaker assumption Sp < Sp 
or Sp CPH < Sp A , respectively, we get the two separate 95% upper confidence 
bound statements 



Se EST < 0.945 



Sc 



CPH 



< 1 



using Theorem 1.3 as in Subsection 1.6, computing w(115, 208, 815) and 
m(245, 54, 969) with the ad hoc function u indicated there. The second bound 
is unfortunately trivial, but the first, while of course weaker than the state- 



ment from (20) obtained under a much stronger assumption, is still good 
enough to show that EST is far from perfect: EST fails to diagnose CAD for 
at least every twentieth CAD patient. 

2. Auxiliary results on latent class models. In this section we de- 
scribe images, under various parameters of interest, of the preimage /x -1 ({<?}) 
= {<& = (tt,x) £ @d : A*(i?) = q} in Subsections 2.1 and 2.2, and of a 
similar preimage with G>2,< in place of 0^ in Subsection 2.3, of a given 



q G prob({0, l} d ) under the function /i defined in (J3]). Informally speaking, 
this amounts to determining the exact joint range of the possible values of 
the prevalence, sensitivities, and specificities (Lemma 2.1|for d — 1 and Lem- 
ma 



2.3 for d — 2), or certain functions of these (Lemmas 2.4 2.16), assuming 
the density q of the joint test results as known or, in a more practical in- 
terpretation, estimated with high accuracy from a very large sample of joint 
test results. For example, using here, for the purpose of illustration only, 
q = q := based on the data k from that is 





= 0.820 


$01 


= 0.078 


%+ 


= 0.90 




= 0.016 




= 0.083 




= 0.10 


q+o 


= 0.84 


$+i 


= 0.16 


<?++ 


= 1 



the pictures of C and C< displayed below near the corresponding Lemmas 2.5 



and 2.10 show as hatched regions the exact joint ranges of the possible values 
of the prevalence and the sensitivity difference, the first in the full latent class 
model, and the second in the restricted one. 

All these lemmas, needed to prove Theorems 1A_ and L3 in Section|4]below, 
are proved in Section [3j where the less interesting results of Subsections 2A_ 



and |2.2| are used for obtaining the more important results of Subsection |2.3 
We have found it suggestive to denote below certain "variables" with Pr 
Sp, Se, Sp 1; Sp 2 , Sei, Se2, and ASe. Perhaps it should be pointed out that 
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for example, denoting a variable by Sei in Lemma 2.6 does not imply that Sei 
be the first coordinate of some tuple called Se. This differs from our use of 



subscripts for tt and Xi f° r example in the definition of A in Lemma 2.3 
where 7Ti is understood to be the last coordinate of tt = (tcq, 7Ti). 

2.1. The case d = 1 and a partial reduction to it. In this subsection, we 
write more precisely /i^ for the function \i from (|3]). 

Lemma 2.1. Ifq £ prob({0, 1}), then {(tti, x | > Xi|i) : x) e /^(M)} 
= {(Pr,Sp,Se) £ [0, l] 3 : (1 - Pr) (1 - Sp) + Pr Se = q x }. 



We recall the dot notation for functions explained in Subsection 1.2 
LEMMA 2.2. Let q £ prob({0, l} 2 ). Then 

(21) {(^x il) )--(^x)e^ 1 ({<i})} = th\{q-+}) 

(22) {(*,X {2) )--(K,X)e^ 1 ({q})} = 

2.2. The case d = 2 for the full latent class model. In this subsection and 
in the next one, we return to the shorter notation \i instead of /i 2 , and we 
assume that q £ prob({0, l} 2 ) is fixed. 

Lemma 2.3. A := { fa, x$, Xojo> Xiji) ■ fax) e is tf*e 

nonempty set of all (Pr, Sp x , Sei, Sp 2 , Se 2 ) G [0,1] 5 satisfying the relations 

(23) (l-Pr)(l-S Pl )+PrSei = q 1+ 

(24) (l-Pr)(l-Sp 2 ) + PrSe 2 = q +1 

(25) (1 - Pr) S Pl A Sp 2 + Pr (1 - Sei V Se 2 ) > <7 o 

(26) (l-Pr)(Spi + Sp 2 -l) + + Pr(l-Sei-Se 2 )+ < g 00 

or, equivalently, 

(27) Pr(Se 2 -Sei) = (1 - Pr)(Sp 2 - S Pl ) + q m - q 10 

(28) Pr(Sei + Se 2 -l) = (1 - Pr) (S Pl + Sp 2 - 1) + q n - q 00 

(29) -q w < Pr(Se 2 -Sei) < q m 

(30) -g 00 < Pr(Sei + Se 2 -l) < q u 

LEMMA 2.4. B := { fa,X%X?\i) ■ fax) e /^(M)} is the nonempty 
set of all (Pr, Sei,Se 2 ) £ [0, l] 3 satisfying the relations (|29]), (30) and 

(31) Pr-g 0+ < Pr Sei < q 1+ 

(32) Pr-g+o < PrSe 2 < q +1 
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Lemma 2.5. C := j xf\\ — Xi\i) '■ i 71 , X) £ A* 1 ({?})} ^ s ^ e nonempty 
set of all (Pr, ASe) G [0,1] x [—1,1] satisfying the inequalities 

(33) (-g 10 )V(? 01 -?io + Pr-l) < PrASe 

< 9oi A (?oi ~ ?io + 1 ~ Pr ) 



C : Full range for Pr and ASe, with q =q 



o 









I llllllllllllm Illlllllllllllliiuntiumtmixu* 











0.0 0.2 0.4 0.6 0.8 1.0 

Pr 



Lemma 2.6. D :— j ("71"!, XijiJ : (^jX) e A 4 1 ({?})} ^ s ^ e nonempty set 
of all (Pr,Sei) G [0, l] 2 satisfying p). 



Lemma 2.7. i? := | f^Xi^) : (7r, x) £ A* 1 ({?})} ^ e nonempty set 
of all (Pr, Se 2 ) G [0, l] 2 satisfying d32b. 



Lemma 2.8. 

^ := { (X^-XS) : (^^G/i- 1 ^})} = [-1,1] 

G := {xi}J = (^ X )e^ 1 ({q})} = [0,1] 

# := {x$ : (tt,x)g A*" 1 ({?})} = [0,1] 
7 := : ( 7r , x )G/i- 1 ({g})} = [0,1] 

2.3. The case d = 2 /or A/ie restricted latent class model. We recall that \x 
denotes //2 and that g G prob({0, l} 2 ) is fixed also in this subsection. Here we 
describe images A< to I< analogous to A to /, with G 02, < : = q} in 
place of G 02 : A*^) = <?}• We recall from Definition 1.2 that the subscript 
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"<" indicates that the specificity of the first test is assumed to be at most 
equal to that of the second. Trivially, 



A< 



{ (Ti>Xojo>X?i>Xo|o:XiJi) : ( 7r '^) G / i HM). Xo|o < Xojo} 



is just the set of all (Pr, Sp 1; Sei, Sp 2 , Se2) G ^4 satisfying Sp x < Sp 2 , and the 
nonemptyness of this set is proved at the beginning of Section [3] below. 

Lemma 2.9. B< := { (vr,, x % X?\l) ■ (tt,x) e X$ < xSS} ^ 

t/ie nonempty set of all (Pr, Sei,Se 2 ) G [0, l] 3 satisfying the relations (30), 



(31), (32), and 



(34) 



tfoi - Qw < Pr (Se 2 - Sei) < q m 



Lemma 2.10. C< := { (tt^J - x$) : (vr, a) G ^({g}), < *$} 
is t/ie nonempty set of all (Pr, ASe) G [0, 1] x [— 1, 1] satisfying the inequalities 



(35) 



g ox - g 10 < Pr A Se < q m A (g 01 - g 10 + 1 - Pr) 



C< : Restricted range for Pr and ASe, with q =q 



m 
o 

CD 
< ° 




0.0 



0.2 



0.4 



0.6 



0.8 



1.0 



Pr 



Lemma 2.11. D< := { (tt^x^) : (vr,x) e /i _1 ({g}), Xojo < X$} * s i/ie 
nonempty set of all (Pr, Sei) G [0, l] 2 satisfying the inequalities 

(36) Pr-g 0+ < Pr Sei < g 1+ A ?r + gl + - 9o1 A (Pr + q l0 - g 01 ) 
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*uXi\i) fox) e m x { o\l < x$} is the 

inequalities 



Lemma 2.12. E< := { ^ 

^l' A,l|l I ■ V" 7 A/ t" 

nonempty set of all (Pr, Se 2 ) G [0, l] 2 satisfying the 
(37) (Pr - g +0 ) V Pr + g| - 9+0 V (g 01 - g M ) < PrSe 2 < q +1 

{ - x[\l ■ fo x) G Xofo ^ Xo|d} « t/ie 

9io> !] % f%i ~ 9io > °; anrf [ _1 > x ] »/9oi ~ <?io < °- 

fox) G ^\{q}), Xo|o < X§o} * 



Lemma 2.13. F< := ^ X i\i ~ : ^, 
nonempty interval [q 01 — q w , 1] if q 01 — q w 

Lemma 2.14. G< := {xlfj 
nonempty interval 



is the 



0. 



9ii 



?i+ + 9oi 

, 2.15. 

nonempty interval 



v(e+i-9io)' 



A 1 



with 







Lemma 2.15. ff< := { xj^ : fox) e /* _1 (M), X® 



« < 



<xg} 



goi A (goi ~ 0io) + x 
0o+ + <?io 9o+ - 9io 



with 



° :=1 




LEMMA 2.16. J< := { ^ : (tt,x) G /x ^{g}), Xo}o < X^} * s i/ie nonempty 
interval 

(?oi - 0io) + > (l - (0io - v (0io - ?oi) 

3. Proofs for Section [2l Let us first address the nonemptyness of the 



sets A to J<. Below, we prove Lemmas 2.3 2.16 ignoring the word "nonempty". 



So, strictly speaking, we should rather write something like "Proof of Weak 
Lemma 2.3f and so on below. We next observe that, say, the interval then 



known to equal G< by Lemma 2.14 is nonempty, as it contains zero. Hence 



A< is nonempty, since 67< is the image of A< under some function. Hence 
A D A< is nonempty. Hence the remaining sets are nonempty, as they are 
images of A or A< under certain functions. 

Proof of Lemma l2~Tl If fox) G l^i 1 ({q}), then we have in particular 

fo,X |o>Xi|i) e M 3 and (1 — ^0(1 — X |o) + ^iXiii = (^ifox))j = 0i- 
This shows that "C" holds in the claimed equality. If, conversely, (Pr, Sp, Se) 
belongs to the second set, and if we put 7r := 1 — Pr, it\ := Pr, Xo|o := Sp, 
Xx|o := 1 - Sp, x |i : = 1 - Se > Xi|i ■= Se, then (vr, x) G ^({q})- Thus 'O" 



holds as well. 



□ 
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Proof of Lemma [272J If (n,x) e ^({q}), tnen ( 7r ) X (1) ) e @i and for 
t G {0, 1} we have 



i i 



M?r,x (1) ) = = 53 53 ^x^ = X) = g,+ 

1=0 1=0 K=0 K=0 

This proves "C" in Q. If (V,^) G then, by 

ft 



for (z,j)e{0,l}x{0,l} 2 



with the nonstandard convention | := |, we define ax 6 mark({0, 1}, {0, l} 2 ) 
with x — an d 



q- 1 

Xj\i = —Y.^jAi = ft forjG{0,l} z 

fti+ t=0 



(M^.x)). = 5^7Ti- - — 

i=0 

This proves "D" in (21). The proof of (22) is analogous. 

Proof of Lemma 12.31 Call A' the set claimed to equal A up to line ( [26] ) . 
If $ = (71", x) ^ ©2, then obviously 



□ 



(38) 



K ) Xo+[o>Xi+|i ) X+o[o>X+i|i) e [o,i] E 



and, using ([3]) and identities like n = 1 — n 1 and Xi+|o = 1 — Ao+|0' the 
condition = g is seen to be equivalent to the system of three equations 

(39) 
(40) 
(41) 



(l-7T 1 )(l-X(H.[o)+ 7r lXl+|l = ft+ 
(1 -71^(1 -X+0\o) +7TlX+l|l = 9+1 

(1 - 7Tl)X00[0 + ^lXooji = 9oo 



Using first Xoo|o < X +|o A X +0 |o and Xoo|i < Xo+|i A X+o|i = i 1 ~ Xi+\i) A (1 - 

X+i|i) and second Xoo|o = X 0+ |o + X +0 \o - (1 - Xu|o) > (Xo+|o + X +0 |o ~ 1 ) + 

and Xoo|i = 1 + Xii|i - Xi+\i ~ X+i\i > (1 - Xi+n - we see that 

equation plj ) would imply the two inequalities 



( 1 - 7r i)Xo+|o A X+o|o + 7r i( 1 -Xi+|i) A(l-x+i|i) > 9oo 

(1 - ^i)(Xo+|o + X+o|o - !) + + ^iC 1 - Xi+|i - X+i|i) + < 9oo 

Thus for G 02 with = q, the left hand side of (38) belongs to A'. 
Hence AC A'. 
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To prove the reversed inclusion, let (Pr, Sp 1; Sei, Sp 2 , Se 2 ) G A'. Put n 
r , 7r 1 ) := (1 — Pr, Pr). Choose two numbers 



Xoo|o 
Xoo|i 



G [(S Pl + Sp 2 -l) + ,S Pl ASp 2 ] 

G [(1-S ei -Se 2 ) + ,(1-S ei ) A(l-Se 2 )] 



such that 



(1 -Pr)x 00 |o + Pr Aoo|i 



This is possible by connectedness, since the two intervals above are nonempty 
and we would get "< g 00 " by choosing the lower endpoints and "> g 00 " for 
the upper ones. Now put 



Xoi|o 
Aio|o 
Xn\o 



SPi — Xoo|o 

Sp 2 ~ Aoo|o 
1 - Spi - Sp 2 



Aoo|o 



Aoi|i 
Aio|i 
An|i 



1 - Sex - xoon 
1 - Sea - Xoon 
Sei + Se 2 - 1 + x 00 |i 



Then (tt,x) G 6 2 satisfies the equations (39)-(41), so that //(i?) = q, and 
the corresponding element of A is (Pr, Sp 1; Sei, Sp 2 , Se 2 ). Hence we also have 
A' C A. 

Obviously, equations (23) and (24) are jointly equivalent to (27) and (28), 
by addition and subtraction. In the presence of (23) and (24), we have 

(1 - Pr) S Pl A Sp 2 = 



Pr 
Pr 



(l-Pr)(l-S Pl )V(l-Sp 2 ) 
(g 1+ -PrSei) V( 9+1 -PrSe 2 ) 



so that, by inserting and rearranging, inequality (25) is equivalent to 

Pr Sei V Se 2 + (q 10 - Pr Sei) V (q 01 - Pr Se 2 ) < q 01 + q w 

which, by considering separately the four cases aVfe + cVrf = a + c etc., 
simplifies to (29). Finally, in the presence of (29), inequality (26) is equivalent 
t° (%o ~ Qn ~~ x ) + + x+ — %o w ^h x := Pr (1 — Sei — Se 2 ), which simplifies 
to — q u < x < q QQ , that is, (30). □ 



Proof of Lemma 12.41 Call B' the set claimed to equal B. By Lem- 



ma 



2.3, we have (Pr, Sei, Se 2 ) G B iff there exist Sp l5 Sp 2 with the quintuple 



(Pr, Spi, Sei, Sp 2 , Se 2 ) G [0, l] 5 satisfying pg-Q or, equivalently, (|27|)-Q 
So in this case, we have in particular (29) and (30), while (23) and (24) 
together with Spi,Sp 2 G [0,1] yield (31) and (32). 



Conversely, if (Pr, Sei,Se2) G B\ then by (31) and (32) we can find 
Spi,Sp 2 G [0,1] satisfying (23) and (24), and hence (27) and (28), and thus 
(Pr, Sei, Se 2 ) G B. □ 
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Proof of Lemma [231 Call C the set claimed to equal C. By Lem- 



ma 



2.4, we have (Pr, ASe) G C iff there exist Sei,Se 2 with (Pr, Sei,Se 2 ) G 



[0, l] 3 satisfying @, Q, Q, (|32|), and ASe = Se 2 - Sei. 

Let (Pr, ASe) G C and let Sei,Se2 be as just stated. Then (31) and (32) 
yield 

(42) Pr ASe = Pr Se 2 - Pr Sex < q +l - (Pr - q 0+ ) = q m - q 10 + 1 - Pr 
and similarly 

Pr ASe > Pr - q +0 - q 1+ = q 01 - q l0 + Pr - 1 



Together with (29), the above yields (33). 

Conversely, let (Pr, ASe) G C . If Pr = 0, then we may put Sei := (ASe) 
and Se 2 := (ASe) + , and observe that (Pr, Sei,Se 2 ) then satisfies (29), ([30 
d3T|, & and ASe = Se 2 - Sei. 



So assume Pr > 0. By connectedness we can choose 

Se. e [(i-£) V 0,£ai] =: 

Se 2 £ fl-^VO.^Al =: [a 2 M 



in such a way that Se 2 — Sei = ASe, since the two intervals above are 
nonempty and since taking Sei = «i and Se 2 = b 2 would yield 



Pr (Se 2 - Sei) 



= q +1 A Pr - (Pr - q 0+ ) V 

= min{g +1 - Pr + q 0+ , q +1 , g 0+ , Pr} 

> min{g 01 - q w + 1 - Pr , q 01 , Pr ASe} 

= PrASe 



using (33) in the last step, while Sei = &i and Se 2 = a 2 would similarly yield 

Pr (Se 2 - Sei) = (Pr - q +0 ) V — q 1+ A Pr 

= max{Pr - q +0 - q l+ , -g +0 , -q l+ , -Pr} 
< max{g 01 - q 10 + Pr - 1 , -q 10 , Pr ASe} 
= PrASe 

using again ((331 in the last step. For every choice of Sei and Se 2 as above, 



the triple (Pr, Sei,Se 2 ) obviously satisfies (29), (31), and (32). To get (30) as 
well, we have to refine our choice: Since the condition Se 2 — Sei = ASe is not 
affected by a same translation of Sei and Se 2 , we could choose Sei and Se 2 
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such that Sej = ctj for some i, which always yields Pr (Sei + Se 2 — 1) < q u as 
in the case of i — 1: 

Pr (Sei + Se 2 - 1) < (Pr — q 0+ ) V + q +l A Pr — Pr 

= max{— q 0+ + q +1 A Pr , q +1 A Pr — Pr} 

< max{g +1 - g 0+ , 0} 

< In 

Alternatively we could choose Sei and Se 2 such that Se^ = b{ for some i, 
yielding Pr (Sei + Se 2 — 1) > — g 00 . By connectedness, then, we can choose Sei 
and Se 2 such that ( pOj ) holds. Then (Pr, Sei, Se 2 ) G D and we get (Pr, ASe) = 
(Pr,Se 2 -Sei) e C. □ 



2.1 



D. Lemma 2.2 
equals 



Proof of Lemma 12.61 Call D' the set claimed to equa 
yields D = {(iri,Xx\i) '■ ( 7r > X) G ^r 1 (?-+)}> wnich by Lemma 

{(Pr,Sei) G [0,1] 2 : 3Spi G [0,1] with (1 - Pr)(l - Spi) + PrSei = q 1+ } 

Thus, if (Pr, Sei) £ D and if Sp x is chosen according to the above, then using 



Spi > and Sp x < 1 yields (31) and hence (Pr, Sei) £ D' . Conversely, if 
(Pr,Sei) G D', then Sp x := 1 - (q 1+ -PrSe x )/(l -Pr) G [0,1], even if Pr = 1 
using 0/0 := 0, hence (Pr, Sei) e D. □ 

Proof of Lemma 12.71 As above for Lemma l2~6l □ 



Proof of Lemma 12.81 In each case, the "C" claim is trivially true. To 



prove "D", use Lemmas 2.5, 2.6, 2.7 with Pr 
with Sei = for I. 



for F, 67, H, and Lemma 2.6 



□ 



Proof of Lemma [2791 Call B' K the set claimed to equal B<. By Lem- 



ma 



2.3, we have (Pr, Sei, Se 2 ) G B< iff there exist Sp 1; Sp 2 with the quintuple 
(Pr, Sp 1; Sei, Sp 2 , Se 2 ) G [0, l] 5 satisfying or equivalently 

and additionally 



(43) 



Spi < Sp 2 



Let (Pr,Sei, Se 2 ) G B<. Then Lemma 2.4 and B< C B yield (29)-(32), and 



using (p7J) and (43), we can sharpen (29) to (34), so that (Pr, Sei,Se 2 ) G -B<. 



Conversely, let (Pr, Sei,Se 2 ) G -B<. Then, since (34) implies (29), Lem 



ma [274 yields (Pr, Sei,Se 2 ) G B, so that there exist Sp 1 ,Sp 2 G [0,1] such 
that (Pr, Sp l5 Sei, Sp 2 , Se 2 ) G A. If Pr = 1, then by Lemma 2.3 we can choose 
e.g. Spi 



Sp 2 



since (|27h-( 30 ) remain unaffected, and hence get (43) 



If Pr < 1, then (27) and the left hand inequality in (34) yield (43). Thus 
(Pr, Sei, Se 2 ) G B<. ' □ 
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ma 
[0,1 



Proof of Lemma 12.101 Call C< the set claimed to equal C<. By Lem- 

iff there exist Sei, Se 2 with (Pr, Se 1; Se 2 ) G 
and ASe 



2^9] we have (Pr, ASe) G C< 



satisfying (30), (31 ) 



Se 2 — Sei. 



Let (Pr, ASe) G C< and let Sei,Se2 be as just stated. Then (31) and (32) 



yield (42), and together with (34) this yields (35), hence (Pr, ASe) G C<. 



2.5 



Conversely, let (Pr, ASe) G C<. Then, since (35) implies (33), Lemma 
yields (Pr, ASe) G C, so that there exist Sei, Se 2 with (Pr, Sei, Se 2 ) G B and 
ASe = Se 2 — Sei. Now the left hand inequality in (35) yields q 01 — q w 
Pr (Se 2 — Sei), which together with (29) yields (34). Hence, by Lemma 2.9 
we have (Pr, Sei, Se 2 ) G B< and thus (Pr, ASe) = (Pr, Se 2 — Sei) G C<. □ 



< 



ma 



Proof of Lemma 12.111 Call D' K the set claimed to equal D<. By Lem- 
2.9, we have (Pr, Sei) G D< iff there exists Se 2 with (Pr, Sei, Se 2 ) G [0, l] 3 
(@, and Q. 

G D< and let Se 2 be as just stated. Then (30) and (34) yield 



satisfying (30) 
Let (Pr, Sei 



PrSei 



* (Pr + Pr (Sei + Se 2 - 1) - Pr (Se 2 - Sei) 

Pr + q 1+ - q m 



2 

< -(Pr + g n - (q Q1 - q 1Q 



and (34) and Se 2 < 1 yield Pr Sei = ? r Se 2 — Pr (Se 2 — Sei) < Pr + q w — q 01 . 



Combined with (31), we get (36). Hence (Pr, Sei) G -D<. 

Conversely, let (Pr, Sei) G D' K . Then, since (36) implies (31), Lemma 



2.6 



yields (Pr, Sei) G D, so that there exists Se 2 with (Pr, Sei,Se 2 ) G B. By 
Lemma 2.4| this is equivalent to Se 2 fulfilling the conditions Se 2 G [0, 1] 



and (29)-(32), and we may assume that Se 2 has been chosen maximal with 



this property. Then at least one of the following four cases occurs, with each 
leading via (36 ) or trivially to Pr (Se 2 — Sei' 
to (34), proving (Pr, Sei) G D< as desired: 
1. Then Pr (Se 2 -Sei) 



Case 1: Se 2 



> q 01 — q w and hence, using (29), 
Pr — PrSei > Pr — (Pr + g 10 — g 01 ) = 



%i ~~ Qw Case 2: Equality holds on the right in (29). Case 3: Equality holds 



on the right in (30). Then 



Pr (Se 2 -Sei) 



> 



Pr 
Pr 



1) -2PrSei 
- %i) = %i ~ 

Case 4: Equality holds on the right in (32). Then Pr (Se 2 — Sei) 



Pr (Sei + Se 2 - 
q n - (Pr + g x _ 



Qw 



Pr Se 9 



□ 



PrSei > q +1 -q 1+ = q 01 - q 10 . 

Proof of Lemma (2. 12L Call E< the set claimed to equal E<. By Lem- 



ma 



2.9, we have (Pr, Se 2 ) G E< iff there exists Sei with (Pr, Sei, Se 2 ) G [0, l] 3 
satisfying pOl, pll, ([32l, and pi. 
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Let (Pr, Se 2 ) G E< and let Sei be as just stated. Then (30) and (34) yield 



Pr Se, 



Pr + Pr (Sei + Se 2 - 1) 



Pr(Se 2 -S ei ) 
Pr + g 01 - q 



+o 



and (34) and PrSei > yield PrSe 2 = Pr (Se 2 — Sei) + PrSei > g 01 — q w . 



2.7 



Combined with (32), we get (37). Hence (Pr, Sei) G -D<- 

Conversely, let (Pr, Se 2 ) G £7<. Then, since ( p7| ) implies (32), Lemma 
yields (Pr, Se 2 ) G £J, so that there exists Sei with (Pr, Sei,Se 2 ) G B. By 
Lemma 2.4, this is equivalent to Sei fulfilling the conditions Sei G [0,1] 



and (29)-(|32[), and we may assume that Sei has been chosen minimal with 
this property. Then at least one of the following four cases occurs, with each 
leading via (37) or trivially to Pr (Se 2 — Sei) > q 01 — q 10 and hence, using (29), 



to (34), proving (Pr, Se 2 ) G E< as desired: 

Case 1: Sei = 0. Then Pr (Se 2 — Sei) = Pr Se 2 > q Q1 — q w . Case 2: Equality 



holds on the right in (29). Case 3: Equality holds on the left in (30). Then 



Pr(Se 2 -Sei) = -Pr - Pr (Sei + Se 2 - 1) + 2 Pr Se 2 
> -Pr + g 00 + (Pr + q Q1 - q +0 ) = q Q1 

Case 4: Equality holds on the left in (31). Then Pr (Se 2 — Sei) 
Pr Sei > (Pr - q +0 ) - (Pr - q 0+ ) = q 01 - q w . 



Qw 

= Pr Se 2 



□ 



Proof of Lemma 12.131 Call F< the interval claimed to equal F<. By 



Lemma 2.10, we have ASe G F< iff there exists Pr with (Pr, ASe) G [0, 1] x 
[—1,1] satisfying (35). 



Let ASe G F< and let Pr be as just stated. If q 01 —q w > 0, then (35) yields 
Pr > and hence ASe > (g 01 — g 10 )/Pr > q Q1 — g 10 . Hence always ASe G F<. 
Conversely, let ASe G F<. If q 01 — q w > 0, then Pr := (g 01 — g 10 )/ASe G 



]0, 1] satisfies (35). If q 01 — q w < 0, then Pr : = 
F<. 



satisfies (35). Hence ASe G 

□ 



Proof of Lemma [2.141 Call 67< the interval claimed to equal G<. By 
we have Sei £ G< iff there exists Pr with (Pr, Sei) G [0, l] 2 



Lemma 2.11 



satisfying (36). 



If q 01 — q 10 < 0, then q +1 — q w < q u and hence G'< = [0, 1]; and given 
Sei G [0, 1], we may put Pr := to satisfy (36), so that also G< = [0, 1]. 



So let q Ql — q w > for the rest of this proof. The three functions fa 
]0, 1] — )■ R defined by 



fi(x) 



X 



2 



Qoi 



2x 



22 



L. MATTNER AND F. MATTNER, JANUARY 19, 2012 



are continuous and monotone with lim x ^ Q f 3 (x) = — oo, so that their point- 
wise infimum / := fi A / 2 A / 3 attains its maximal value at Pr^ := 1 or at 
some x G ]0, 1] satisfying fi(x) = fj(x) with i < j. The latter three equations 
have the unique solutions Pri 2 := q 1+ + q 01 , Pri 3 := q +1 , Pr 2 3 := q +1 — q w , 
each strictly positive by q 01 — q w > 0, and we get 

/(Pri) = Qi+ A — ' ^ ^ ul A(l + g 10 -g 01 ) = g 1+ 



/(Pr 12 ) 
/(Pri 3 ) 



/(Pr 23 ) = ^^A 



9i+A 








?i+ + 






G 






9+i - 


?10 




?1+ 



?i+ + Qoi 
+ Q11 



2g + i 

gii 



We have /(Pri) < /(Pri 2 ) since g 1+ + g 01 < 1. Writing here a ~ 6 to indicate 
that a6>0ora = 6 = holds, clearing fractions yields 

/(Pria) - /(Pria) ~ 2 - + g 01 )(g 1+ + g u ) = q 01 (q 01 - q 1+ ) 

/(Pr 23 ) - /(Pria) ~ 2 g +1 g n - (q +1 - q 10 )(q 1+ + q n ) = q Q1 (q 1+ - q 01 ) 

Hence max / = /(Pr 12 ) V /(Pr 23 ), namely attained at Pr i2 if q 01 > g 1+ and 
at Pr 23 if q 01 < q 1+ . 

Now let Sei G G< and let Pr G [0, 1] with (36). Then in particular Pr Sei < 
Pr + q w — q m and thus Pr > 0, using q 01 — q 10 > 0. Thus (36) yields Sei < 
/(Pr) < max / and thus Sei G Cr<. 

Conversely, let Sei G £?<. Then Sei < max / and thus, by lim^Q f(x) = 
—00 and continuity of /, there is a Pr G ]0, 1] with /(Pr) = Sei, yielding 
PrSei = Pr/(Pr) = R.H.S.Q > L.H.Sjg), with the last inequality due 
to Pr < 1 and q G prob({0, l} 2 ), so that (|36[) holds and hence Sei £ G<- □ 



Proof of Lemma 12. 151 Very similar to the above proof of Lemma |2. 14 



with the following differences: Use Lemma 2.12 in place of Lemma 2.11 After 
again restricting attention to the main case where q 0l — q 10 > 0, define now 

h(x) := 1 - ?±fi h{x) := \ + q °^ q ^ f 3 (x) := q -^^ 
x 2 2x x 

and observe that / := f\ V / 2 V / 3 is minimized over ]0, 1] at one of Pri := 1, 

Pri2 ■= q +0 + q i, Pri3 '■=%+, Pr 2 3 ■= Qo+ - Qw Af ter computing /(Pri) = 

Q +1 , /(Pr 12 ) = /(Pr 13 ) = 2 f^ ) /(Pr 23 ) = J^, one observes 

/(Pri) > /(Pr 12 ) and 

/(Pr 12 )-/(Pr 13 ) ~ gio(<Z+o-<Zoi) ~ /(Pri 3 ) - /(Pr 23 ) 
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Hence min/ = /(Pr 12 ) A /(Pr 23 ) 



□ 



Proof of Lemma 12.161 Call I< the interval claimed to equal J<. By 
Lemma [2. 10, we have Pr G J< iff there exists ASe with (Pr, ASe) G [0, 1] x 



[—1,1] satisfying (35). 

Let Pr G J< and let ASe be as just stated. If q 01 — q w = 0, then I< = [0, 1] 
and hence Pr G /<. If <7oi — q w > 0, then (35) implies ASe > and hence Pr > 
(g 01 — g 10 )/ASe > q i~ q w and thus again Pr G /<. If q 01 — q w < 0, then either 
Qoi ~ Qio + 1 ~ Pr > and then Pr < 1 - (<?i - <?oi) + > or q 01 - q w + 1 - Pr < 



and then (35) yields ASe < and thus Pr < (g 01 — g 10 )/ASe < q w — q 01 , so 
that Pr G /< also in this case. 

Conversely, let Pr G /<. If q 01 — q w < 0, then either q m — q 10 + 1 — Pr > 
and then ASe := satisfies (35), or q 01 — q w + 1 — Pr < and then Pr G J< 
yields Pr < q 10 — q m so that ASe := — 1 satisfies (35). If q 01 — q w > 0, then 
ASe := (g 01 — g 10 )/Pr G ]0, 1] satisfies (35). Hence Pr G J< in every case. □ 



4. The remaining proofs. Below we prove Theorems 1.1 and Theo- 



rem 1.2 by applying the rather general and trivial Lemmas 4.1 and 4.2 to- 
gether with the special Lemmas 2.3, 2.10, and 2.12 We then deduce Theo- 
rem [L2] from Theorem 11.31 

Lemma 4.1. Let V = (P# : $ G 6) and Q = (Q v : rj G H) be experiments 
on the same sample space X , with parameters of interest k : — > M. and 
A : H — > M. Let f3 G [0, 1]. 

A. Assume the implication 



(44) 



i|GH =4> 3^ G with R d = Q v and «(??) < X(r]) 



Then every lower (3-confidence bound for (V, k) is also one for (Q,X). 
B. Assume the implication 



(45) 



7] G H 3i? G with P,, = Q v and > A(r/) 



and let k and k be both lower /3-confidence bounds for (V, k) and for (Q,X), 
with k worse than k for (V, k). Then k is also worse than k for (Q,X). 
Analogously for upper confidence bounds, with "< " and "> " interchanged. 

Proof. A. Let k be a lower /3-confidence bound for (V, k) and let i|GH. 



With $ from Q then Q v (k < X(rj)) = P#(k < A(r/)) > P#(k < «(#)) > (3. 

B. Let 7] G H and t < X(rj). With •& from (45) we then have t < and 
hence Q v (k >t) = P#(k > t) < P#(k > t) = Q v (k > t). □ 
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Below, a natural exponential family, or NEF for short, is any statistical 
model Q = (Q v : i] G H) such that, for some fcGN and some measure v on 
M fc , we have HCR fc and, for each 77 G H, is a law on IR fc with a //-density 
proportional to y 1— >■ exp(X)f =1 77^). 

LEMMA 4.2. Let Q = (Q v : j) 6 H) k a iVE!F wzt/i H open and nonempty. 
Let A : H — > M. be lower semicontinuous and let A and A be equivalent lower 

confidence bounds for (Q, A). Then A A sup A(H) = A A sup A(H) Q-a.s. 



Proof. The equivalence assumption yields 
(46) qJ\ >t) = Q v (\>t) if r] G H and t G [-oo, A(r?)[ 



For fixed t G R with t < sup A(H), the subfamily (Q v : 77 G H, A(r?) > t) is 
again a NEF with nonempty open parameter space, hence complete in the 
sense of Lehmann-Scheffe, so that (46) yields {A > t} = {A > t} Q-a.s. Hence 



{A A sup A(H) ^ A A sup A(H)} 



U 



{A < t < A} U {A < t < A} 



teQ, <<supA(H) 



is a Q-null set. 



□ 



Proof of Theorem 11.11 We first check the applicability of Lemma [4~T 
to some pairs of estimation problems. Recall /1 and P$ from ^ and (|4]). 

The problems (A4, (Jsj) ) and (P 2 , ([9])), in this order but also in the reversed 
one, fulfill the assumptions of Lemma |4.1| A and B: For the stated order, 
given = (71", x) ©2, put q := G prob({0, l} 2 ), and observe that then 
M„ „ = and R.H.S.(|8l) = R.H.S.Q by Lemma [23p7]) . For the reversed 



order, given q G prob({0, 1} ), choose $ G 02 with /x(t9) = g using the 
nonemptyness claim of Lemma [2.3[ and finish as in the preceding sentence. 
The problems (P2, @) and (7^2,< ? ( 10 )) fulfill the assumptions of Lem- 
A, since for $ G O2X, we also have $ G O2 and R.H.S.([9]) < 



ma 



4.1 



R.H.S.(IO) 



ma 
7? = 



The problems (V 2 ,<, (0) and (M, 
J1\A and B: Given q G prob({0, l} 2 
02,< with /i(7?) = g, SO Pg = 



(tt,%) G 



fulfill the assumptions of Lem- 

1 yields a 
RH.S.(|8l). 



, Lemma 2.10 with Pr 



and R.H.S.(IO) 



Applying now Lemma |4.1| several times yields parts A and B of the theo- 
rem. The subclaim of Part C referring only to (Ai, ^) and (P 2 , ([9])) follows 
directly from Parts A and B, as "A strictly worse than A" is equivalent to 

"A worse than A, and not A worse than A". 
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Finally, let A be admissible as a lower /3-confidence bound for (Ai, (j8j) 
By Part A, A is also a /3-confidence bound for (7 ? 2,<, (10)). Let A be a better 



/3-confidence bound for (J- > 2,<i (10 )). We have to show that A is equivalent 



to A for (7\<, (10)). By Part B, A is better than A also for (M, ®) and 



hence, by the assumed admissibility, in fact equivalent to A for (yVf, (|5 
With a view towards applying Lemma 4.2, we put H := {t] G ] — oo,0[ 3 



J2i=iQ Vi < 1}, define a function r : H ->■ prob({0, 1} ) by r 00 (?7) := e^i, 

Toi(v) '■= eV2 i T io(v) '■= e??E S an( l T ii{v) '■= 1 ~~ Y^=i^ Vi for i| 6 H, and put 
Q:= Mot, that is, Q = (Q v : f] 6 H) with 



M, 



Let k denote the 



',r\ xv J-n,r(»?) • 

function ([§]) and let A := hot so that, writing Prob(A") for the set of all laws 

r> 2 



on X := {k G N^' 1 : A; H 



n}, the diagram 
H 




Prob(A') 



commutes. Then, trivially, A := A and A := A are equivalent lower confidence 
bounds for (Q, A). Now Lemma 



4.2 



applies and yields A A 1 = A A 1 every- 



where on X and hence, as (10) is [— 1, l]-valued, the wanted equivalence. □ 



Proof of Theorem OJ The problems (M, (13)) and (V 2 ,<, (14)) fulfill 



the assumptions of the "upper" version of Lemma 4.1 A, since for $ G 02,< and 
q := /i($), we have R.H.S.(Jl~3|) > R.H.S.(14) by Lemma 2.14 The problems 



(7 ? 2,<,(14)) and (.M,(13)) fulfill the assumptions of the "upper" version of 
Lemma 4.1 A and B, since for q G prob({0, l} 2 ), Lemma 2.14 yields a d G 
2i < with //(tf) = q and R.H.S.(|li|) = R.H.S.Q. Hence Lemma fO] yields 
parts A and B of the theorem. 

To pro ye P art C, we can proceed as in the last paragraph of our proof of 
Theorem LI with the following chan ges: Given now S and S, we let n denote 
the function (13). Then Lemma 4.2 applies with A := — S and A := —S to 



yield S'VO = S\/0. Here —nor] is indeed lower semicontinuous, but one could 
also replace H by {rj G H : rj 2 > r] 3 }] then —nor] would be continuous. □ 



Proof of Theorem I1.31 The function ( |i~6| ) is an upper /3-confidence 
bound in the quadrinomial model Ai := (M ni9 : q G prob({0, l} 2 )) and for the 
parameter (13) from Theorem 1.2, since for q G prob({0, l} 2 ), conditioning 
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on the upper left corner of our 2x2 table yields 

M n , g ({* G f4°' 1}2 : k ++ = n, u(k w , k 01 , k n ) > R.H.S.Q} 



= E - m) M m , p ({j G nJ 1 ' 2 ' 31 : j + = m, u(j) > R.H.S.(|T5[}) 

m=0 

> P 

with b n g oo denoting a binomial density, and with p G prob({l, 2, 3}) defined 
by p := (1 - g 00 ) _1 (g 10 , q Q1 , q n ) if g 00 < 1, and p : = (0, 0, 1) if q 00 = 1. Hence 
the claim follows from Theorem [L2l A. □ 
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